Deep reinforcement learning of passenger behavior in multimodal journey planning with proportional fairness
نویسندگان
چکیده
Abstract Multimodal transportation systems require an effective journey planner to allocate multiple passengers transport operators. One example is mobility-as-a-service, a new mobility service that integrates various modes through single platform. In such multimodal and diverse planning problem, accommodating heterogeneous with different dynamic preferences can be challenging. Furthermore, may behave based on experiences expectations, in the sense experience affects their state decision of next service. Current methods treating each optimization as non-time varying problem cannot adequately model passenger memories over many journeys time. this paper, we Markov where prior have transient effect future long-term satisfaction retention rate. As such, formulate multi-objective considers individual preferences, experiences, memories. The proposed approach dynamically determines utility weights obtain optimal plan for status. To balance profit received by operator, present variant-based proportional fairness. Our experiments using real-world synthetic datasets show our enhances satisfaction, compared baseline methods. We demonstrate overall increased 2.3 times, resulting higher rate caused levels. facilitate participation operators promote acceptance MaaS.
منابع مشابه
Fairness in Reinforcement Learning
We initiate the study of fairness in reinforcement learning, where the actions of a learning algorithm may affect its environment and future rewards. Our fairness constraint requires that an algorithm never prefers one action over another if the long-term (discounted) reward of choosing the latter action is higher. Our first result is negative: despite the fact that fairness is consistent with ...
متن کاملOperation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm
: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...
متن کاملA Decision Support System for Urban Journey Planning in Multimodal Public Transit Network
The goal of this paper is to develop a Decision Support System (DSS) as a journey planner in complex and large multimodal urban network called Rahyar. Rahyar attempts to identify the most desirable itinerary among all feasible alternatives. The desirability of an itinerary is measured by a disutility function, which is defined as a weighted sum of some criteria. The weight...
متن کاملTraining an Interactive Humanoid Robot Using Multimodal Deep Reinforcement Learning
Training robots to perceive, act and communicate using multiple modalities still represents a challenging problem, particularly if robots are expected to learn efficiently from small sets of example interactions. We describe a learning approach as a step in this direction, where we teach a humanoid robot how to play the game of noughts and crosses. Given that multiple multimodal skills can be t...
متن کاملDeep Reinforcement Learning with POMDPs
Recent work has shown that Deep Q-Networks (DQNs) are capable of learning human-level control policies on a variety of different Atari 2600 games [1]. Other work has looked at treating the Atari problem as a partially observable Markov decision process (POMDP) by adding imperfect state information through image flickering [2]. However, these approaches leverage a convolutional network structure...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Neural Computing and Applications
سال: 2023
ISSN: ['0941-0643', '1433-3058']
DOI: https://doi.org/10.1007/s00521-023-08733-4